SEQUENCE LISTING 



<110> Cannon, Paul David 

Sankuratri, Suryanarayana 

<120> Human Intestinal Npt2B 



<130> ROCH-001 



<150> 60/119,321 
<151> 1999-02-09 



<160> 2 



<170> FastSEQ for Windows Version 4.0 



<210> 1 
<211> 689 
<212> PRT 
<213> human 



<400> 1 



Met 


Ala 


Pro 


Trp 


Pro 


Glu 


Leu 


Gly 


1 








5 








Tyr 


Leu 


Glu 


Gly 


Ala 


Ala 


Gly 


Gin 


















Lys 


Glu 


Thr 


Asn 


Lys 


Asn 


Asn 


Thr 






35 










40 


Leu 


Leu 


Pro 


Ser 


Tyr 


Ser 


Thr 


Ala 




50 










55 




Val 


Asp 


Asp 


Pro 


Trp 


Asn 


Leu 


Pro 


65 










70 






Trp 


Ser 


Glu 


Arg 


Asp 


Thr 


Lys 


Gly 










85 








He 


Gly 


Arg 


Leu 


He 


Leu 


Leu 


Leu 








100 










Ser 


Leu 


Asp 


He 


Leu 


Ser 


Ser 


Ala 






115 










120 


Ala 


Gly 


Gin 


Phe 


Phe 


Ser 


Asn 


Ser 




130 










135 




Gly 


Leu 


Val 


He 


Gly 


Val 


Leu 


Val 


145 










150 






Thr 


Ser 


Thr 


Ser 


He 


Val 


Val 


Ser 










165 








Val 


Arg 


Ala 


Ala 


He 


Pro 


He 


He 








180 










He 


Thr 


Asn 


Thr 


lie 


Val 


Ala 


Leu 






195 










200 


Phe 


Arg 


Arg 


Ala 


Phe 


Ala 


Gly 


Ala 




210 










215 




Leu 


Ser 


Val 


Leu 


Val 


Leu 


Leu 


Pro 


225 










230 






Glu 


He 


He 


Thr 


Gin 


Leu 


He 


Val 










245 








Glu 


Asp 


Ala 


Pro 


Asp 


Leu 


Leu 


Lys 



Asp 


Ala 


Gin 


Pro 


Asn 


Pro 


Asp 


Lys 




10 










15 




Gin 


Pro 


Thr 


Ala 


Pro 


Asp 


Lys 


Ser 


ZD 










^n 






Glu 


Ala 


Pro 


Val 


Thr 


Lys 


He 


Glu 










45 








Thr 


Leu 


He 


Asp 


Glu 


Pro 


Thr 


Glu 








60 










Thr 


Leu 


Gin 


Asp 


Ser 


Gly 


He 


Lys 






75 










80 


Lys 


He 


Leu 


Cys 


Phe 


Phe 


Gin 


Gly 




90 










95 




Gly 


Phe 


Leu 


Tyr 


Phe 


Phe 


Val 


Cys 


105 










110 






Phe 


Gin 


Leu 


Val 


Gly 


Gly 


Lys 


Met 










125 








Ser 


He 


Met 


Ser 


Asn 


Pro 


Leu 


Leu 








140 










Thr 


Val 


Leu 


Val 


Gin 


Ser 


Ser 


Ser 






155 










160 


Met 


Val 


Ser 


Ser 


Ser 


Leu 


Leu 


Thr 




170 










175 




Met 


Gly 


Ala 


Asn 


He 


Gly 


Thr 


Ser 


185 










190 






Met 


Gin 


Val 


Gly 


Asp 


Arg 


Ser 


Glu 










205 








Thr 


Val 


His 


Asp 


Phe 


Phe 


Asn 


Trp 








220 










Val 


Glu 


Val 


Ala 


Thr 


His 


Tyr 


Leu 






235 










240 


Glu 


Ser 


Phe 


His 


Phe 


Lys 


Asn 


Gly 




250 










255 




Val 


He 


Thr 


Lys 


Pro 


Phe 


Thr 


Lys 



1 





















265 










270 






Leu 


lie 


Val 


Gin 


Leu 


Asp 


Lys 


Lys 
280 


Val 


He 


Ser 


Gin 


He 
285 


Ala 


Met Asn 




Asp 


Glu 


Lys 


Ala 


Lys 


Asn 


Lys 


Ser 


Leu 


Val 


Lys 


He 


Trp 


Cys 


Lys Thr 




c j<j 










295 










300 










rue 


Thr 
1 111 


Asn 


Lys 


Thr 


Gin 


He 


Asn 


Val 


Thr 


Val 


Pro 


Ser 


Thr 


Ala Asn 




JU j 








310 










315 








320 




Cys 


Th r 

inr 


Ser 


Pro 
r 1 u 


Ser 


Leu 


Cys 


up 


Thr 


Asp 


Glv 


He 


Gin 


Asn 


Trp Thr 




















330 










335 




Met 


Lys 


Asn 


Val 


Thr 


Tyr 


Lys 


Glu 


Asn 


He 


Ala 


Lys 


Cys 


Gin 


His He 


















345 










350 






Phe 


Val 


Asn 

ODD 


Phe 


His 


Leu 


Pro 


Asp 
joy 


Leu 


Ala 


Val 


Gly 


Thr 
365 


He 


Leu Leu 




lie 


Leu 

■J (u 


ber 


Leu 


Leu 


vol 


Leu 
j / j 




ftl V 




Leu 


He 
380 


Met 


He 


Val Lys 




He 


Leu 


Gly 


Ser 


val 


Leu 


Lys 


biy 


uin 


v ai 


Al a 
nld 


Thr 
i til 


Val 
val 


He 


TiV<? Lv*5 

jjy 0 i-J y 0 




385 








oyu 


















400 


?~ 
Q 


Thr 


He 


Asn 


inr 


Asp 


rne 


Dvrt 

rro 


Pho 

rne 


rro 


Pho 
rne 


Al a 
rVla 


irp 


Leu 


Thr 


Glv Tvr 
uiy iyi 










/AC 

4UD 










4 1 n 

4 1U 










415 


Q 


Leu 


Aia 


lie 


Leu 
4zu 


vai 


laiy 


Al a 
rvla 


v»iy 


4&D 


Thr 


Phe 


He 


Val 


Gin 
430 


Ser Ser 


U 

a s 

: j 


Ser 


Val 


Pne 
4 Jo 


m l~ v 

inr 


Car 

ber 


ai a 
Aia 


Leu 


inr 
4 4 n 

44U 


rro 


Leu 


T 1 P 
lie 


vjiy 


He 
445 


Glv 

vjiy 


Val He 


fft 


Thr 


lie 


Glu 


Arg 


Aia 


Tyr 


Pro 


Leu 


Thr 

i nr 


Leu 


ft! v 
uiy 


Ser 


Asn 


He 


Gly Thr 


45 




4o(J 










4 Cs t% 

4 DO 










460 








Thr 


Thr 


Thr 


Ala 


Ti A 

1 le 


Leu 


Ala 


i\±a 


Leu 


Al a 
nld 


OCi 


Prn 
r 1 vj 


Gly 


Asn 


Ala Leu 




465 










4 /u 










47^ 








480 




Arg 


Ser 


Ser 


Leu 


Gin 


He 


Ala 


Leu 


Cys 


His 


Phe 


Phe 


Phe 


Asn 


He Ser 


\& 








4 0 D 










a on 
4 yu 










4 95 


gass 


Gly 


He 


Leu 


Leu 


Trp 


Tyr 


Pro 


lie 


Pro 


Pho 
rne 


Thr 


Arg 


Leu 


Pro 


He Arg 






500 










DUO 










510 




' "i 


Met 


Ala 


Lys 


Gly 


Leu 


biy 


Asn 


lie 


oer 


al a 

nld 


Lys 


T\/r 
lyr 


Arg 


Trp 


Phe Ala 








CI t 

d1d 




























val 


Pne 


Tyr 


Leu 


lie 


lie 


Phe 


Pho 

rne 


Pho 

rne 


Xj6U 


Tl p 
1 ie 


Pro 


Leu 


Thr 


Val Phe 






530 








D OD 










04 U 










Gly 


Leu 


ber 


Leu 


Aia 


uiy 


Trp 


Arg 


Va 1 

val 


T\on 


Val 


Glv 
uiy 


Val 


Gly 


Val Pro 




545 










ccn 










jjj 








560 




val 


Val 


Phe 


lie 


lie 


1 le 


Leu 


V dl 


Leu 


*w»y 0 
0 / U 


Leu 


Arg 


Leu 


Leu 


Gin Ser 
575 




Arg 


Cys 


Pro 


Arg 


vai 


Leu 


Pro 


Lys 


Lys 


Leu 


f41 n 


Asn 


Trn 


Asn 


Phe Leu 






DoU 










DOD 










'590' 






Pro 


Leu 


Trp 


Met 


Arg 


Ser 


Leu 


Lys 


Prr> 


Trp 


ASp 


Ala 


Val 


Val Ser Lys 




























605 








Phe 


Thr 


Gly 


Cys 


Phe 


Gin 


Met 


Arg 


Cys 


Cys 


Cys 


Cys 


Cys 


Arg Val Cys 






610 








615 










620 










Cys 


Arg 


Ala 


Cys 


Cys 


Leu 


Leu 


Cys 


Gly 


Cys 


Pro 


Lys 


Cys 


Cys Arg Cys 




625 








630 










635 








640 




Ser 


Lys 


Cys 


Cys 


Glu 


Asp 


Leu 


Glu 


Glu 


Ala 


Gin 


Glu 


Gly 


Gin Asp Val 






645 










650 










655 




Pro 


Val 


Lys 


Ala 


Pro 


Glu 


Thr 


Phe 


Asp 


Asn 


He 


Thr 


He 


Ser Arg Glu 








660 










665 










670 






Ala 


Gin 


Gly 


Glu 


Val 


Pro 


Ala 


Ser 


Asp 


Ser 


Lys 


Thr 


Glu 


Cys Thr Ala 








675 










630 










685 







Leu 



<210> 2 



2 



<211> 4137 
<212> DNA 
<213> human 

<400> 2 

ctgacgta.gg cccagcacct gcggagggag cgctgaccat ggctccctgg cctgaattgg 60 

gagatgccca gcccaacccc gataagtacc tcgaaggggc cgcaggtcag cagcccactg 120 

cccctgataa aagcaaagag accaacaaaa ataacactga ggcacctgta accaagattg 180 

aacttctgcc gtcctactcc acggctacac tgatagatga gcccactgag gtggatgacc 240 

cctggaacct acccactctt caggactcgg ggatcaagtg gtcagagaga gacaccaaag 300 

ggaagattct ctgtttcjttc caagggattg ggagattgat tttacttctc ggatttctct 360 

actttttcgt gtgctccctg gatattctta gtagcgcctt ccagctggtt ggaggaaaaa 420 

tggcaggaca gttcttcagc aacagctcta ttatgtccaa ccctttgttg gggctggtga 480 

tcggggtgct ggtgaccgtc ttggtgcaga gctccagcac ctcaacgtcc atcgttgtca 540 

gcatggtgtc ctcttcattg ctcactgttc gggctgccat ccccattatc atgggggcca 600 

acattggaac gtcaatcacc aacactattg ttgcgctcat gcaggtggga gatcggagtg 660 

agttcagaag agcttttgca ggagccactg tccatgactt cttcaactgg ctgtccgtgt 720 

U tggtgctctt gcccgtggag gtggccaccc attacctcga gatcataacc cagcttatag 780 

tggagagctt ccacttcaag aatggagaag atgccccaga tcttctgaaa gtcatcacta 840 

agcccttcac aaagctcatt gtccagctgg ataaaaaagt tatcagccaa attgcaatga 900 

?Z acgatgaaaa agcgaaaaac aagagtcttg tcaagatttg gtgcaaaact tttaccaaca 960 

H! agacccagat taacgtcact gttccctcga ctgctaactg cacctcccct tccctctgtt 1020 

[U ggacggatgg catccaaaac tggaccatga agaatgtgac ctacaaggag aacatcgcca 1080 

CP aatgccagca tatctttgtg aatttccacc tcccggatct tgctgtgggc accatcttgc 1140 

OH tcatactctc cctgctggtc ctctgtggtt gcctgatcat gattgtcaag atcctgggct 1200 

-E ctgtgctcaa ggggcaggtc gccactgtca tcaagaagac catcaacact gatttcccct 1260 

h ttccctttgc atggttgact ggctacctgg ccatcctcgt cggggcaggc atgaccttca 1320 

H tcgtacagag cagctctgtg ttcacgtcgg ccttgacccc cctgattgga atcggcgtga 1380 

fl taaccattga gagggcttat ccactcacgc tgggctccaa catcggcacc accaccaccg 1440 

f7 ccatcctggc cgccttagcc agccctggca atgcattgag gagttcactc cagatcgccc 1500 

tgtgccactt tttcttcaac atctccggca tcttgctgtg gtacccgatc ccgttcactc 1560 

gcctgcccat ccgcatggcc aaggggctgg gcaacatctc tgccaagtat . cgctggttcg 1620 

Q ccgtcttcta cctgatcatc ttcttcttcc tgatcccgct gacggtgttt ggcctctcgc 1680 

RJ tggccggctg gcgggtgctg gttggtgtcg gggttcccgt cgtcttcatc atcatcctgg 1740 

tactgtgcct ccgactcctg cagtctcgct gcccacgcgt cctgccgaag aaactccaga 1800 

actggaactt cctgccgctg tggatgcgct cgctgaagcc ctgggatgcc gtcgtctcca 1860 

agttcaccgg ctgcttccag atgcgctgct gctgctgctg ccgcgtgtgc tgccgcgcgt 1920 

gctgcttgct gtgtggctgc cccaagtgct gccgctgcag caagtgctgc gaggacttgg - 1980 

aggaggcgca ggaggggcag gatgtccctg tcaaggctcc tgagaccttt gataacataa 2040 

ccattagcag agaggctcag ggtgaggtcc ctgcctcgga ctcaaagacc gaatgcacgg 2100 

" ccttgtaggg gacgccc'cag attgtcaggg atggg^gat ggtccttgag ttttgcatgc 2160 

tctcctccct cccacttctg caccctttca ccacctcgag gagatttgct cccca'ttagc 2220 

gaatgaaatt gatgcagtcc tacctaactc gattcccttt ggcttggtgg gtaggcctgc 2280 

agggcacttt tattccaacc cctggtcact cagtaatctt ttactccagg aaggcacagg 2340 

atggtaccta aagagaatta gagaatgaac ctggcgggac ggatgtctaa tcctgcacct 24 00 

agctgggttg gtcagtagaa cctattttca gactcaaaaa ccatcttcag aaagaaaagg 24 60 

cccagggaag gaatgtatga gaggctctcc cagatgagga agtgtactct ctatgactat 2520 

caagctcagg cctctccctt tttttaaacc aaagtctggc aaccaagagc agcagctcca 2580 

tggcctcctt gccccagatc agcctgggtc aggggacata gtgtcattgt ttggaaactg 2640 

cagaccacaa ggtgtgggtc tatcccactt cctagtgctc cccacattcc ccatcagggc 2700 

ttcctcacgt ggacaggtgt gctagtccag gcagttcact tgcagtttcc ttgtcctcat 27 60 

gcttcgggga tgggagccac gcctgaacta gagttcaggc tggatacatg tgctcacctg 2820 

ctgctcttgt cttcctaaga gacagagagt ggggcagatg gaggagaaga aagtgaggaa 2880 

tgagtagcat agcattctgc caaaagggcc ccagattctt aatttagcaa actaagaagc 2940 

ccaattcaaa agcattgtgg ctaaagtcta acgctcctct cttggtcaga taacaaaagc 3000 

cctccctgtt ggatcttttg aaataaaacg tgcaagttat ccaggctcgt agcctgcatg 3060 

ctgccacctt gaatcccagg gagtatctgc acctggaata qctctccacc cctctctgcc 3120 



3 



ry 

Jul E 

£ 

S 

Q 
u 

M 
M 




tccttacttt ctgtgcaaga tgacttcctg ggttaacttc cttctttcca tccacccacc 3180 

cactggaatc tctttccaaa catttttcca ttttcccaca gatgggcttt gattagctgt 3240 

cctctctcca tgcctgcaaa gctccagatt tttggggaaa gctgtaccca actggactgc 3300 

ccagtgaact gggatcattg agtacagtcg agcacacgtg tgtgcatggg tcaaaggggt 3360 

gtgttccttc tcatcctaga tgccttctct gtgccttcca cagcctcctg cctgattaca 3420 

ccactgcccc cgccccaccc tcagccatcc caattcttcc tggccagtgc gctccagcct 3480 

tatctaggaa aggaggagtg ggtgtagccg tgcagcaaga ttggggcctc ccccatccca 3540 

gcttctccac catcccagca agtcaggata tcagacagtc ctcccctgac cctccccctt 3600 

gtagatatca attcccaaac agagccaaat actctatatc tatagtcaca gccctgtaca 3660 

gcatttttca taagttatat agtaaatggt cttctagtgc tctcatttgg aaatgaggca 3720 

ggcttcttct atgaaatgta aagaaagaaa ccactttgta tattttgtaa taccacctct 3780 

gtggccatgc ctgccccgcc cactctgtat atatgtaagt taaacccggg caggggctgt 3840 

ggccgtcttt gtactctggt gatttttaga aattgaatct ttgtacttgc attgattgta 3900 

taataatttt gagaccaggt ctcgctgtgt tgctcaggct ggtctcaaac tcctgagatc 3960 

aagcaatccg cccacctcag cctcccaaag tgctgagatc acaggcgtga gccaccacca 4020 

ggcctgattg taattttttt tttttttttt tttactggtt atgggaaggg agaaataaaa 4080 

tcatcaaacc caaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaa 4137 
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FIG. 1 
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1 MAPWPELGDA QPNPDKYLEG AAGQQPTAPD KSKETNKNNT EAPVTKIELL 

51 PSYSTATLID EPTEVDDPWN LPTLQDSGIK WSERDTKGKI LCFFQGIGRL 

101 ILLLGFLYFF VCSLDILSSA FQLVGGKMAG QFFSNSSIMS NPLLGLVIGV 

151 LVTVLVQSSS TSTSIVVSMV SSSLLTVRAA IPIIMGANIG TSITNTIVAL 

201 MQVGDRSEFR RAFAGATVHD FFNWLSVLVL LPVEVATHYL EIITQLIVES 

251 FHFKNGEDAP DLLKVITKPF TKLIVQLDKK VISQIAMNDE KAKNKSLVKI 

301 WCKTFTNKTQ INVTVPSTAN CTSPSLCWTD GIQNWTMKNV TYKENIAKCQ 

351 HIFVNFHLPD LAVGTILLIL SLLVLCGCLI MIVKILGSVL KGQVATVIKK 

401 TINTDFPFPF AWLTGYLAIL VGAGMTFIVQ SSSVFTSALT PLIGIGVITI 

4 51 ERAYPLTLGS NIGTTTTAIL AALASPGNAL RSSLQIALCH FFFNISGILL 

501 WYPIPFTRLP IRMAKGLGNI SAKYRWFAVF YLIIFFFLIP LTVFGLSLAG 

551 WRVLVGVGVP WFIIILVLC LRLLQSRCPR VLPKKLQNWN FLPLWMRSLK 

601 PWDAWSKFT GCFQMRCCCC CRVCCRACCL LCGCPKCCRC SKCCEDLEEA 

651 QEGQDVPVKA PETFDNITIS REAQGEVPAS DSKTECTAL* (SEQ ID NO: 01) 
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FIG. 2i 





1 


CTGACGTAGG 


CCCAGCACCT 




51 


CCTGAATTGG 


GAGATGCCCA 




101 


CGCAGGTCAG 


CAGCCCACTG 




151 


ATAACACTGA 


GGCACCTGTA 




201 


ACGGCTACAC 


TGATAGATGA 




251 


ACCCACTCTT 


CAGGACTCGG 




301 


GGAAGATTCT 


CTGTTTCTTC 




351 


GGATTTCTCT 


ACTTTTTCGT 




401 


CCAGCTGGTT 


GGAGGAAAAA 




451. 


TTATGTCCAA 


CCCTTTGTTG 




501 


TTGGTGCAGA 


GCTCCAGCAC 




551 


CTCTTCATTG 


CTCACTGTTC 




601 


ACATTGGAAC 


GTCAATCACC 




651 


GATCGGAGTG 


AGTTCAGAAG 


^>~~ 


701 


CTTCAACTGG 


CTGTCCGTGT 




751 


ATTACCTCGA 


GATCATAAGC 


[iff 


801 


AATGGAGAAG 


ATGCCCCAGA 




851 


AAAGCTCATT 


GTCCAGCTGG 




901 


ACGATGAAAA 


AGCGAAAAAC 


y = 


951 


TTTACCAACA 


AGACCCAGAT 


GO 


1001 


CACCTCCCCT 


TCCCTCTGTT 


* 


1051 


AGAATGTGAC 


CTACAAGGAG 


E 


1101 


AATTTCCACC 


TCCCGGATCT 




1151 


CCTGCTGGTC 


CTCTGTGGTT 




1201 


CTGTGCTCAA 


GGGGCAGGTC 


SS.SSS 


1251 


GATTTCCCCT 


TTCCCTTTGC 


SJ 
fl 


1301 


CGGGGCAGGC 


ATGACCTTCA 


1351 


CCTTGACCCC 


CCTGATTGGA 


Rl 8 


1401 


CCACTCACGC 


TGGGCTCCAA 


14 


1451 


CGCCTTAGCC 


AGCCCTGGCA 




1501 


TGTGCCACTT 


TTTCTTCAAC 




1551 


CCGTTCACTC 


GCCTGCCCAT 




1601 


TGCCAAGTAT 


CGCTGGTTCG 




1651 


TGATCCCGCT 


GACGGTGTTT 




1701 


GTTGGTGTCG 


GGGTTCCCGT 




1751 


CCGACTCCTG 


CAGTCTCGCT 




1801 


ACTGGAACTT 


CCTGCCGCTG 




1851 


GTCGTCTCCA 


AGTTCACCGG 




1901 


CCGCGTGTGC 


TGCCGCGCGT 




1951 


GCCGCTGCAG 


CAAGTGCTGC 




2001 


GATGTCCCTG 


TCAAGGCTCC 




2051 


AGAGGCTCAG 


GGTGAGGTCC 




2101 


CCTTGTAGGG 


GACGCCCCAG 




2151 


TTTTGCATGC 


TCTCCTCCCT 




2201 


GAGATTTGCT 


CCCCATTAGC 




2251 


GATTCCCTTT 


GGCTTGGTGG 




2301 


CCTGGTCACT 


CAGTAATCTT 




2351 


AAGAGAATTA 


GAGAATGAAC 




2401 


AGCTGGGTTG 


GTCAGTAGAA 




2451 


AAAGAAAAGG 


CCCAGGGAAG 




2501 


AGTGTACTCT 


CTATGACTAT 




2551 


AAAGTCTGGC 


AACCAAGAGC 




2601 


AGCCTGGGTC 


AGGGGACATA 



GCGGAGGGAG 
GCCCAACCCC 
CCCCTGATAA 
ACCAAGATTG 
GCCCACTGAG 
GGATCAAGTG 
CAAGGGATTG 
GTGCTCCCTG 
TGGCAGGACA 
GGGCTGGTGA 
CTCAACGTCC 
GGGCTGCCAT 
AACAGTATTG 
AGCTTTTGCA 
TGGTGCTCTT 
CAGCTTATAG 
TCTTCTGAAA 
ATAAAAAAGT 
AAGAGTCTTG 
TAACGTCACT 
GGACGGATGG 
AACATCGCCA 
TGCTGTGGGC 
GCCTGATCAT 
GCCACTGTCA 
ATGGTTGACT 
TCGTACAGAG 
ATCGGCGTGA 
CATCGGCACC 
ATGCATTGAG 
ATCTCCGGCA 
CCGCATGGCC 
CCGTCTTCTA 
GGCCTCTCGC 
CGTCTTCATC 
GCCCACGCGT 
TGGATGCGCT 
CTGCTTCCAG 
GCTGCTTGCT 
GAGGACTTGG 
TGAGACCTTT 
CTGCCTCGGA 
ATTGTCAGGG 
CCCACTTCTG 
GAATGAAATT 
GTAGGCCTGC 
TTACTCCAGG 
CTGGCGGGAC 
CCTATTTTCA 
GAATGTATGA 
CAAGCTCAGG 
AGCAGCTCCA 
GTGTCATTGT 



CGCTGACCAT 
GATAAGTACC 
AAGCAAAGAG 
AACTTCTGCC 
GTGGATGACC 
GTCAGAGAGA 
GGAGATTGAT 
GATATTCTTA 
GTTCTTCAGC 
TCGGGGTGCT 
ATCGTTGTCA 
CCCCATTATC 
TTGCGCTCAT 
GGAGCCACTG 
GCCCGTGGAG 
TGGAGAGCTT 
GTCATCACTA 
TATCAGCCAA 
TCAAGATTTG 
GTTCCCTCGA 
CATCCAAAAC 
AATGCCAGCA 
ACCATCTTGC 
GATTGTCAAG 
TCAAGAAGAC 
GGCTACCTGG 
CAGCTCTGTG 
TAACCATTGA 
ACCACCACCG 
GAGTTCACTC 
TCTTGCTGTG 
AAGGGGCTGG 
CCTGATCATC 
TGGCCGGCTG 
ATCATCCTGG 
CCTGCCGAAG 
CGCTGAAGCC 
ATGCGCTGCT 
GTGTGGCTGC 
AGGAGGCGCA 
GATAACATAA 
CTCAAAGACC 
ATGGGGGGAT 
CACCCTTTCA 
GATGCAGTCC 
AGGGCACTTT 
AAGGCACAGG 
GGATGTCTAA 
GACTCAAAAA 
GAGGCTCTCC 
CCTCTCCCTT 
TGGCCTCCTT 
TTGGAAACTG 



GGCTCCCTGG 

TCGAAGGGGC 

ACCAACAAAA 

GTCCTACTCC 

CCTGGAACCT 

GACACCAAAG 

TTTACTTCTC 

GTAGCGCCTT 

AACAGCTCTA 

GGTGACCGTC 

GCATGGTGTC 

ATGGGGGCCA 

GCAGGTGGGA 

TCCATGACTT 

GTGGCCACCC 

CCACTTCAAG 

AGCCCTTCAC 

ATTGCAATGA 

GTGCAAAACT 

CTGCTAACTG 

TGGACCATGA 

TATCTTTGTG 

TCATACTCTC 

ATCCTGGGCT 

CATCAACACT 

CCATCCTCGT 

TTCACGTCGG 

GAGGGCTTAT 

CCATCCTGGC 

CAGATCGCCC 

GTACCCGATC 

GCAACATCTC 

TTCTTCTTCC 

GCGGGTGCTG 

TACTGTGCCT 

AAACTCCAGA 

CTGGGATGCC 

GCTGCTGCTG 

CCCAAGTGCT 

GGAGGGGCAG 

CCATTAGCAG 

GAATGCACGG 

GGTCCTTGAG 

CCACCTCGAG 

TACCTAACTC 

TATTCCAACC 

ATGGTACCTA 

TCCTGCACCT 

CCATCTTCAG 

CAGATGAGGA 

TTTTTAAACC 

GCCCCAGATC 

CAGACCACAA 
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2651 GGTGTGGGTC TATCCCACTT CCTAGTGCTC CCCACATTCC CCATCAGGGC 
2701 TTCCTCACGT GGACAGGTGT GCTAGTCCAG GCAGTTCACT TGCAGTTTCC 
2751 TTGTCCTCAT GCTTCGGGGA TGGGAGCCAC GCCTGAACTA GAGTTCAGGC 
2801 TGGATACATG TGCTCACCTG CTGCTCTTGT CTTCCTAAGA G AC AG AG AG T 
2851 GGGGCAGATG GAGGAGAAGA AAGTGAGGAA TGAGTAGCAT AGCATTCTGC 
2901 CAAAAGGGCC CCAGATTCTT AATTTAGCAA ACTAAGAAGC CCAATTCAAA 
2951 AGCATTGTGG CTAAAGTCTA ACGCTCCTCT CTTGGTCAGA TAACAAAAGC 
3001 CCTCCCTGTT GGATCTTTTG AAATAAAACG TGCAAGTTAT CCAGGCTCGT 
3051 AGCCTGCATG CTGCCACCTT GAATCCCAGG GAGTATCTGC ACCTGGAATA 
3101 GCTCTCCACC CCTCTCTGCC TCCTTACTTT CTGTGCAAGA TGACTTCCTG 
3151 GGTTAACTTC CTTCTTTCCA TCCACCCACC CACTGGAATC TCTTTCCAAA 
3201 CATTTTTCCA TTTTCCCACA GATGGGCTTT GATTAGCTGT CCTCTCTCCA 
3251 TGCCTGCAAA GCTCCAGATT TTTGGGGAAA GCTGTACCCA ACTGGACTGC 
3301 CCAGTGAACT GGGATCATTG AGTACAGTCG AGCACACGTG TGTGCATGGG 
3351 TCAAAGGGGT GTGTTCCTTC TCATCCTAGA TGCCTTCTCT GTGCCTTCCA 
3401 CAGCCTCCTG CCTGATTACA CCACTGCCCC CGCCCCACCC TCAGCCATCC 
3451 CAATTCTTCC TGGCCAGTGC GCTCCAGCCT TATCTAGGAA AGGAGGAGTG 
3501 GGTGTAGCCG TGCAGCAAGA TTGGGGCCTC CCCCATCCCA GCTTCTCCAC 
3551 CATCCCAGCA AGTCAGGATA TCAGACAGTC CTCCCCTGAC CCTCCCCCTT 
3601 GTAGATATCA ATTCCCAAAC AGAGCCAAAT ACTCTATATC TATAGTCACA 
3651 GCCCTGTACA GCATTTTTCA TAAGTTATAT AGTAAATGGT CTTCTAGTGC 
3701 TCTCATTTGG AAATGAGGCA GGCTTCTTCT ATGAAATGTA AAGAAAGAAA 
3751 CCACTTTGTA TATTTTGTAA TACCACCTCT GTGGCCATGC CTGCCCCGCC 
3801 CACTCTGTAT ATATGTAAGT TAAACCCGGG CAGGGGCTGT GGCCGTCTTT 
3851 GTACTCTGGT GATTTTTAGA AATTGAATCT TTGTACTTGC ATTGATTGTA 
3901 TAATAATTTT GAGACCAGGT CTCGCTGTGT TGCTCAGGCT GGTCTCAAAC 
3951 TCCTGAGATC AAGCAATCCG CCCACCTCAG CCTCCCAAAG TGCTGAGATC 
4001 ACAGGCGTGA GCCACCACCA GGCCTGATTG TAATTTTTTT TTTTTTTTTT 
4051 TTTACTGGTT ATGGGAAGGG AGAAATAAAA TCATCAAACC CAAAAAAAAA 
4101 AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAA (SEQ ID NO:02) 



